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We investigate regular realizability (RR) problems, which are the prob¬ 
lems of verifying whether the intersection of a regular language - the 
input of the problem - and a fixed language, called a filter, is non-empty. 
In this paper we focus on the case of context-free filters. The algorithmic 
complexity of the RR problem is a very coarse measure of the complexity 
of context-free languages. This characteristic respects the rational dom¬ 
inance relation. We show that a RR problem for a maximal filter under 
the rational dominance relation is P-complete. On the other hand, we 
present an example of a P-complete RR problem for a non-maximal filter. 
We show that RR problems for Greibach languages belong to the class 
NL. We also discuss RR problems with context-free filters that might 
have intermediate complexity. Possible candidates are the languages with 
polynomially-bounded rational indices. We show that RR problems for 
these filters lie in the class NSPACE(log^ n). 


1 Introduction 

The context-free languages form one of the most important classes for formal 
language theory. There are many ways to characterize complexity of context-free 
languages. In this paper we propose a new approach to classification of context- 
free languages based on the algorithmic complexity of the corresponding regular 
realizability (RR) problems. 

By ‘regular realizability’ we mean the problem of verifying whether the inter¬ 
section of a regular language - the input of the problem - and a fixed language, 
called a filter, is non-empty. The filter F is a parameter of the problem. Depend¬ 
ing on the representation of a regular language, we distinguish the deterministic 
RR problems RR(F) and the nondeterministic ones NRR(F), which correspond 
to the description of the regular language either by a deterministic or by a non¬ 
deterministic finite automaton. 
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The relation between algorithmic complexities of RR(F) and NRR(T') is still 
unknown. For our purpose - the characterization of the complexity of a context- 
free language - the nondeterministic version is more suitable. One of the reasons 
for this choice is a rational dominance relation (defined in Section [5]). We 
show below that the dominance relation on filters Fi ^rat ^2 implies the log-space 
reduction NRR(Fi) NRR(F 2 ). So our classification is a very coarse version 
of the well-known classification of CFL by the rational dominance relation (see 
the book [5] for a detailed exposition of this topic). 

Depending on a filter F, the algorithmic complexity of the regular realiz¬ 
ability problem varies drastically. There are RR problems that are complete 
for complexity classes such as L, NL, P, NP, PSPACE |llll) . In [T^] a huge 
range of possible algorithmic complexities of the deterministic RR problems was 
presented. We prove below that for context-free nonempty filters the possible 
complexities are in the range between NL-complete problems and P-complete 
problems. Examples of P-complete RR problems are provided in Section [31 The 
filter consisting of all words provides an easy example of an NL-complete RR 
problem. In this case, the problem is exactly the reachability problem for di¬ 
graphs. The upper bound by the class P follows from the reduction of an arbi¬ 
trary NRR-problem specified by a context-free filter to the problem of verifying 
the emptiness of a language generated by a context-free grammar. We prove it 
in Section |3l 

We will call a context-free language L easy if NRR(L) G NL and hard if 
NRR(L) is P-complete. In Section |3| we present an example of a non-generator 
of the CFLs cone, which is hard in this sense. In SectionjTjwe provide examples of 
easy languages. They cover a rather wide class - the so-called Greibach languages 
introduced in [7]. 

The exact border between hard and easy languages is unknown. Moreover, 
there are candidates for an intermediate complexity of RR problems. They are 
languages with polynomially-bounded rational indices. 

The rational index was introduced in [^. Recall that rational index PL{n) 
of a language L is a function that returns the maximum length of the shortest 
word from the intersection of the language L and a language L(A) recognized 
by an automaton A with n states, provided L(A) D L ^ 0: 

PLin-)= max min{\u\\u € L{A) n L}. (I) 

A-.\QA\='n, L{A)r\L^0 

The growth rate of the language’s rational index is an another measure of the 
complexity of a language. This measure is also related to the rational dominance 
(see Section |S] for details). 

In Section 0 we prove that the RR problem for a context-free filter hav¬ 
ing polynomially-bounded rational index is in the class NSPACE(log^ n). Note 
also that there are many known CFLs having polynomially-bounded rational 
indices m- But the RR problems for these languages are in NL. It would be 
interesting to find more sophisticated examples of CFLs having polynomially- 
bounded rational indices. 




2 Preliminaries 


The main point of our paper is investigation of the complexity of the NRR- 
problem for filters from the class of context-free languages CFL. 

Definition 1. The regular realizability problem NRR(F) is the problem of ver¬ 
ifying non-emptiness of the intersection of the filter F with a regular language 
L{A), where A is an NFA. Formally 

NRR(F) = {A I A is an NFA and L{A) n F ^ 0}. 

It follows from the definition that the problem NRR(A*) for the filter con¬ 
sisting of all words under alphabet A is the well-known NL-complete problem 
of digraph reachability. We will show below that NRR(L) S P for an arbitrary 
context-free filter L. So it is suitable to use deterministic log-space reductions 
in the analysis of algorithmic complexity of the RR problems specified by CFL 
filters. We denote the deterministic log-space reduction by ^log- 

Let us recall some basic notions and fix notation concerning the CFLs. For 
a detailed exposition see [213] . We will refer to the empty word as £. Let A„ 
and An be the n-letter alphabets consisting of the letters {oi, 02 ,..., a„} and 
{ 01 , 02 ,... An} respectively. A well-known example of a context-free language, 
the Dyck language Dn, is defined by the grammar 

S —>■ SS I £ I OlFoi I • • • I anSttn. 

Fix alphabets A and B. A language L C A* is rationally dominated by 
L' C B* if there exists a rational relation R such that L = R{L'), where R{X) = 
(it S A* I e A {v,u) G Rj. We denote rational domination as ^rat- We say 
that languages L, L' are rationally equivalent if L ^rat L' and L' ^^at L. 

A rational relation is a graph of a multivalued mapping t/j. We will call the 
mapping t/j with a rational graph as a rational transduction. So L ^rat L' means 
that L — T{i[L'). Such a transduction can be realized by a rational transducer 
(or finite-state transducer) T, which is a nondeterministic finite automaton with 
input and output tapes, where £-moves are permitted. We say that u belongs to 
Tiv) if for the input v there exists a path of computation on which T writes the 
word u on the output tape and halts in the accepting state. Formally, a rational 
transducer is defined by the 6-tuple T = (A, 5, Q, qeiA-, F), where A is the input 
alphabet, B is the output alphabet, Q is the (finite) state set, <70 is the initial 
state, F GQ IS the set of accepting states and i5: Q x (A U £) x (R U £) x Q is 
the transition relation. 

Let two rational transducers Tf and T 2 correspond to rational relations i?i 
and i? 2 , respectively. We say that a rational transducer T = Ti o r 2 is the 
composition of Ti and T 2 if the relation R corresponding to T such that R = 
{{u, v) I 3y{u, y) GRi, {y, v) G R 2 }. 

Define the composition of transducer T and automaton A in the same way: 
automaton B = T o A recognizes the language {w\3y G L{A) {w, y) G i?}. 

The following proposition is an algorithmic version of the Elgot-Mezei theo¬ 
rem (see, e.g., [21 Th. 4.4]). 


Proposition 1 . The composition of transducers and the composition of a trans¬ 
ducer and an automaton are computable in deterministic log space. 

A rational cone is a class of languages closed under rational dominance. 
Let T{L) denote the least rational cone that includes language L and call it 
the rational cone generated by L. Such a cone is called principal. For example, 
the cone Lin of linear languages (see [ 2 ] for definition) is principal: Lin = 7 ”(*S'), 
where the symmetric language S over the alphabet X = {xi, 0:2, xi, 5:2} is defined 
by the grammar 

S —>■ xiSxi I X2SX2 I s. 

For a mapping a ^ La the substitution a is the morphism from A* to the 
power set 2 ® such that a{a) = La. The image cr(L) of a language L C A* is 
defined in the natural way. The substitution closure of a class of languages C is 
the least class containing all substitutions of languages from C to the languages 
from C. We need two well-known examples of the substitution closure. The class 
Qrt of the quasirational languages is the substitution closure of the class Lin. 
The class of Greibach languages [ 7 ] is the substitution closure of the rational 
cone generated by the Dyck language Di and the symmetric language S. 

It is important for our purposes that rational dominance implies a reduction 
for the corresponding RR problems. 

Lemma 1 . If Fi <rat F2 then NRR(Fi) NRR(F2). 

Proof. Let T be a rational transducer such that Fi = T{F2) and let A be an 
input of the NRR(Fi) problem. Construct the automaton B = T o A and use 
it as an input of the NRR(F2) problem. It gives the log-space reduction due to 
Proposition [H 

In particular, this lemma implies that if a problem NRR(F) is complete in 
a complexity class C, then for any filter F' from the rational cone F{F) the 
problem NRR(F'') is in the class C. 

We will use the following reformulation of the Chomsky-Schiitzenberger the¬ 
orem. 

Theorem (Chomsky, Schiitzenberger). CFL = T(D2). 

In the next section, we prove that NRR(L)2) is P-complete under deterministic 
log-space reductions. Thus, it follows from the Chomsky-Schiitzenberger theorem 
and Lemma [T] that any problem NRR(F) for a CFL filter F lies in the class P. 

3 Hard RR problems with CFL filters 

In this section we present examples of hard context-free languages. The first 
example is the Dyck language D2. 

By use of Lemma [Hand the Chomsky-Schiitzenberger theorem, we conclude 
that any generator of the CFL cone is hard. But there are additional hard lan¬ 
guages. We provide such an example, too. 

We start with some technical lemmas. The intersection of a CFL and a ra¬ 
tional language is a CFL. We need an algorithmic version of this fact. 


Lemma 2 . Let G = (N, E, P, S) he a fixed context-free grammar. Then there 
exists a deterministic log-space algorithm that takes a description of an NFA A — 
(Q_a, E, Sa, (lo> constructs a grammar G' = {N\ E, P', S') generating 

the language L{G) n L{A). The grammar size is polynomial in \Qa\- 

This fact is well-known. We provide the proof because the construction will 
be used in the proof of Theorem 0 below. 

Proof (of Lemma First, to make the construction clearer, we assume that 
automaton A has no e-transitions. Let N' consist of the axiom S' and nonter¬ 
minals [qAp], where A G N and q,p G Qa- Construct P' by adding for each rule 
A ^ XiX2 - • ■ Xn from P the set of rules 

{[qAp] [qXiri][riX2r2] - ■ -[rn-iXnp] \ q,p, ri, r2,..., r„_i € Qa} 

to P'. Also add to P' rules [qap] —>• cr if 6 A{q,cr) = p and S' —> [go'S'g/] for each 
qf from Fa- 

Now we prove that L{G') = L{G) fl L{A). Let G derive the word w = 
W1W2 ■ • ■ Wn- Then grammar G' derives all possible sentential forms 

[qoWiri][riW2r2] ■ ■ ■ 

where qf G Fa and n G Qa- And [q'o'u;iri][ri'u;2r2] ■ • ■ [rn-iWnqf] W1W2 ■ ■ - Wn 
iff there is a successful run for the automaton A on ui. If G' derives a word w 
then each symbol Wi of the word has been derived from some nonterminal [qwip]. 
Due to the construction of the grammar G' the word w has been derived from 
some sentential form [qoWiri][riW2r2] ■ ■ • [r-n-iWnqf], which encodes a successive 
run of A on re. Thus G' derives the word w only if G does as well. 

The size of G' is polynomial in Qa- The size of iV' is |iV| • IQa]"^ + 1 - Let k 
be the length of the longest rule in P. Then for each rule from P there are at 
most \Qa\^^^ rules in P' and for rules in the form [qap] —>■ cr or 5 " —>• [qoSqf] 
there are at most 0 {[Qa\'^) rules in P'. 

Finally, the grammar G' is log-space constructible, because the rules of 
P' corresponding to the particular rule from P can be generated by inspect¬ 
ing all {k + l)-tuples of states of A and k = 0 ( 1 ). Adding e-transitions just 
increases fc -|- 1 to 2 k. For each rule A —>• Xi - - - Xn we add rules [qAp] —5> 
[qXiqi][q2X2q3] • ■ • [q2n-iA„p], where qi = qi+i or qi -4 qi+i for all i. In the case 
of [qap] —>■ a rules we add all such rules that q q', p' p and 6 {q', a) = p'. 

Note that if grammar G is in Chomsky normal form, then the number of 
nonterminals of the grammar G' is 0 {[Qa\^)- Recall that for a grammar in the 
Chomsky normal form, the right-hand side of each rule consists of either two 
nonterminals, or one terminal. The empty word may be produced only by the 
axiom and the axiom does not appear in a right-hand side of any rule. 

Also we need an algorithmic version of the Chomsky-Schiitzenberger theorem. 

Lemma 3 . There exists a deterministic log-space algorithm that takes a de¬ 
scription of a context-free grammar G = (A^, E, P, S) and produces a rational 
transducer T such that T{D2) = L{G). 


Now we are ready to prove hardness of the Dyck language £> 2 . 

Theorem 1. The problem NRR(_D 2 ) is 'P-complete. 

Proof. To prove P-hardness we reduce the well-known P-complete problem of 
verifying whether a context-free grammar generates an empty language [B] to 
NRR(£) 2 ). Based on a grammar G, construct a transducer T such that T{D 2 ) = 
L{G) using Lemma [3] Let ^ be a nondeterministic automaton obtained from 
the transducer T by ignoring the output tape. Then L{A) H D 2 is nonempty iff 
L{G) is nonempty. The mapping G —^ is the required reduction. 

To prove that NRR(D 2 ) lies in P we reduce this problem to the problem of 
non-emptiness of a language generated by a context-free grammar. 

For an input A construct the grammar G such that L{G) = L{A) fl £>2 using 
Lemma [2] 

Corollary 1. Any generator of the CFL cone is a hard language. 

Now we present another example of a hard language. Boasson proved in [3] 
that there exists a principal rational cone of non-generators of the CFL cone 
containing the family Qrt of the quasirational languages. 

Below we establish P-completeness of the nondeterministic RR problem for 
a generator of this cone. The construction follows the exposition in [3]. 

For brevity we denote the alphabet of the Dyck language £>i by T = {a, a}*. 
Recall that the syntactic substitution of a language M into a language £ is 


Lf M = {miXim2X2 ■ ■ ■ rUrXr \ rni,..., irir G M, X 1 X 2 • • ■ ccr € £} U ({e} fl £). 

We also use the language ^ t #* which is the syntactic substitution of the 
language ff* in the symmetric language S. 

Let M = aS:y,dL)e. The language is defined recursively in the following 

way: x S iff either x G M or 


X = ayiaziay2az2a ■ ■ ■ 


where yi,yn G X*, yi G for 2 ^ i ^ n — 1, aZiO G and ayiy2 ■ ■ ■ yn<i G 

M. 

Let TTx '. {X U T)* —>■ A* be the morphism that erases symbols from the 
alphabet X. The language is defined to be \Di). 

Finally, we set S\ = U M(+). 

Note that the languages S and are rationally equivalent. So is a 
generator of the cone Lin of the linear languages. 

By combining this observation with Propositions 3.19 and 3.20 from [3], we 
get the following fact. 

Theorem 2. 5'^ is not a generator of the CFL cone, but the cone generated by 
contains all quasirational languages. 


The language is the union of two languages. In the proof of the P- 
completeness for the problem NRR(S'^), we will use automata that do not accept 
words from the language For this purpose we need a notion of a marked 

automaton. 

Definition 2. An NFA A over the alphabet An U A„ is marked if there exists 
a function h: Qa Z satisfying the relations 

h{q') = h{q) + 1, if there exists a transition q q' in A, 

h{q') = h{q) — 1, if there exists a transition q q in A, 

h(q) =0, if <7 is either the initial state or an accepting state of A. 

( 2 ) 

In what follows we will identify for brevity the (directed) paths along the 
graph of an NFA and the corresponding words in the alphabet of the automaton. 
The vertices of the graph, i.e., the states of the automaton, are identified in this 
way with the positions of the word. 

The height of a position is the difference between the number of the sym¬ 
bols Ui and the number of the symbols Oi preceding the position. In terms of 
the position heights, the words in Di are characterized by two conditions: the 
height of any position is nonnegative and the height of the final position is 0. 

Proposition 2. Let A be an NFA such that D 2 H L{A) ^ 0. Then there exists 
a word w € D 2 Cl L{A) ^ 0 such that the height of any position in the word w 

is 0 {\Qa\Y- 

Proof. The heights of positions are upperbounded by the height of the derivation 
tree in the grammar generating the language D 2 n L{A) 0. 

It is easy to see that for any grammar generating a non-empty language there 
is a word such that the height of a derivation tree for the word is at most the 
number of nonterminals in the grammar. 

To finish the proof, we use the grammar constructed by Lemma [5] from 
the grammar generating D 2 in the Chomsky normal form. This grammar has 
0{\Qa\'^) nonterminals. 

In the proof below we need a syntactic transformation of automata over the 
alphabet A 2 L) A 2 . 

Proposition 3. There exists a transformation p that takes a description of an 
automaton A over the alphabet A 2 U A 2 and produces a description of a marked 
automaton A! = p{A) such that (i) L[A) fl £>2 0 iff L{A') fl D 2 0 and (ii) 

for any w € L{A!) the height of any position is nonnegative and the height of the 
final position is 0. The transformation p is computed in deterministic log space. 

Proof. Let m be an upper bound on the heights of the positions in a word 
w £ L{A)riD 2 . By Proposition[2l m is 0(|(5^p) . Note that m can be computed 
in deterministic log space. 


The state set of the automaton A' is Qj^ x {0,..., m} U {r}, where r is the 
specific absorbing rejecting state. 

If q —> q', where a € {oi, 02 }, is a transition in the automaton A then there 
are transitions {q, i) [q', f + 1 ) for all 0 ^ i < m and the transition ( 5 , m) r 
in the automaton A' . 

If g —> q', where a G {ai, 02 }, is a transition in the automaton A then there 
are transitions (g, i) (g', * — 1 ) for all 0 < i ^ m and the transition (g, 0 ) r 
in the automaton A'. 

The initial state of the automaton A' is (go,0), where go is the initial state 
of the automaton A. The set of accepting states of the automaton is F x {0}, 
where F is the set of accepting states of the automaton A. 

It is clear that the description of the automaton A' is constructed in deter¬ 
ministic log space. 

Condition (ii) is forced by the construction of the automaton A' . It remains 
to prove that condition (i) holds. 

Note that if L[A) 0 02 = 0 then L{A!) 0 02=0 too. In the other direction, 
if L{A)002 7 ^ 0, then by Proposition[5] there exists a word w G L{A)0O2 such 
that the height of any position in the word does not exceed m. So the word is 
accepted by the automaton A'. 

Theorem 3. NRR(S'|t) is P-complete under deterministic log space reduetions. 

Proof. We reduce NRR(Il 2 ) to NRR(S'^). 

Let A be an input of the problem NRR(Zl 2 ) and A' = fJ-(A) be the marking 
transformation of the automaton A. 

We are going to construct the automaton B over the alphabet AU X L) {#} 
such that L{A') O O 2 ^ 0 iS L{B) O ^ 0. 

The morphism ip: {A 2 U A 2 )* —>■ (A U X U {#})* is defined as follows: 


ip: ai^ axi, 
p:di^xid##, 
p: 02^ ax2, 
p: 02 X2a##- 

The automaton B accepts words of the form axiX 2 WX 2 Xid, where w = p(u). 
It simulates the behavior of the automaton A! on the word u and accepts iff A' 
accepts the word u. 

It follows from the definitions that A u £ O 2 then axiX 2 p{v)x 2 Xid, G 
So if L{A') 0 02^0 then L{B) O ^ 0. 

Now we are going to prove the opposite implication. Let 

w = axiX 2 p{u)x 2 Xid G fl L(B). 

The automaton A! is marked and B simulates the behavior of A! on u. So the 
heights of positions in w are nonnegative and the height of the final position is 0. 


Thus w ^ \Di). Take a pair of the corresponding parentheses 

a, a in the word w: 

w = woaXiWiXjaw2- 

If i ^ j then w ^ . So i = j for all pairs of the corresponding parentheses. 

This implies m S Z?2 H L(A'). 

We just have proved the correctness of the reduction. It can be computed in 
log space due to the following observations. To produce the automaton B from 
the automaton A we need to extend the state set by a finite number of pre- and 
postprocessing states to operate with the prefix axiX2 and with the suffix X2Xia. 
Also we need to split all states in Q^i in pairs to organize the simulation of A' 
while reading the pairs of symbols axi and Xid. The transitions by the symbol 
^ are trivial: q —> q for all q. 

4 Easy RR problems with CFL filters 

Now we present examples of easy languages. The simplest example is rational 
languages. Next we prove that the symmetric language and the language Di are 
easy. A simple observation shows that a substitution of easy languages into an 
easy language is easy. Thus we conclude that Greibach languages are easy. 

Lemma 4. NRR(S') G NL. 

The proof of Lemma 0 ] is a slight modification of the arguments from [T] that 
prove a similar result for the language of palindromes. 

Lemma 5. Let he a context-free language recognizable by a counter automa¬ 
ton. Then problem NRR(Lc) in NL. 

In the proof we will use the following fact. 

Lemma 6 ([13]). Let M he a counter automaton with n states. Then the short¬ 
est word w from the language L{M) has length at most and the counter of M 
on processing the word w doesn’t exceed the value . 

We now return to the proof of Lemma O 

Proof. Let M be a counter automaton that accepts by reaching the final state 
such that M recognizes the language L^. Let A be an automaton on the input 
of the regular realizability problem. 

Construct the counter automaton with the set of states Qm x Qa^ the 
initial state {q^ .q^), with the set of accepting states Fm x and with the 
transition relation 5 ma such that 5 M(,q,(r,z) h {q',z'), 5 j((p,a) = p' implies 
((g,p), O’,-z) h Hq', p'), z'). This is the standard composition construction. 
The automaton M_4 is a counter automaton with \Qm \ ■ |Q.a| = c x n states. 
Using Lemma |6] we obtain that the value of M^’s counter does not exceed (cn)^ 
on the shortest word from L{Mjf). Then construct automaton B such that L{B) 


contains all such words from L{M_a) such that the counter of does not 
exceed (cn)^. The automaton B has 0{v?) states and can be constructed in log 
space in the straightforward way similar to the proof of Proposition [31 Note that 
L{M^) ^ 0 iff L{B) ^ 0 . So the map B gives a reduction of the problem 
NRR(Lc) to the problem NRR(T'*), which is in NL. 

The language Di is recognized by a counter automaton in the obvious way. 

Corollary 2. NRR(Di) G NL. 

Lemma 7. If L, La for all a G A, are easy languages then a{L) is also easy. 

Proof. Let A be an input for the problem NRR(cr(L)). Define the automaton 
A' over the alphabet A with the state set Qj\' = Qa- There is a transition 
q ^ q' in the automaton A' iff there exists a word w G La such that q ^ q' in 
automaton A. 

It is clear from the definition that L{A) fl cr{L) ^ 0 iff L{A') fl L 7 ^ 0 . To 
apply an NL-algorithm for NRR(L) one needs the transition relation of A'. The 
transition relation is not a part of the input now. But it can be computed by 
NL-algorithms for NRR(La)- It is clear that the resulting algorithm is in NL. 

Applying Lemma [71 Lemma [4] and Corollary |2l we deduce with the theorem. 

Theorem 4. Greibaeh languages are easy. 

5 The case of polynomially-bounded rational index 

We do not know whether there exists a CFL that is neither hard nor easy. 
In this section we indicate one possible class of candidates for an intermediate 
complexity: the languages with polynomially-bounded rational indices. 

Rational index appears to be a very useful characteristic of a context-free 
language because rational index does not increase significantly under rational 
transductions. 

Theorem (Boasson, Courcelle, Nivat, 1981, [5]). If L' ^rat L then there 
exists a constant c sueh that PL'in) ^ cn{pL{cn) - 1 - 1 ). 

Thus the rational index can be used to separate languages w.r.t. the rational 
dominance relation. Note that the rational index of a generator of the CFL cone 
has rather good estimations. 

Theorem (Pierre, 1992, [9]). The rational index of any generator of the 
rational cone o/CFL belongs to exp(0(n^/logn)). 

The examples of easy languages in Section [4] have polynomially-bounded 
rational indices. Moreover, context-free languages with rational index 0{n'^) for 
any positive algebraic number 7 > 1 were presented in m- All of them are easy. 
The proof is rather technical and is skipped here. Thus it is quite natural to 
suggest that any language with polynomially-bounded rational index is easy. 

Unfortunately we are able to give only a weaker bound on the algorithmic 
complexity in the case of polynomially-bounded rational index. 


Theorem 5. For a context-free filter F with polynomially-hounded rational in¬ 
dex, the problem NRR(T) lies in NSPACE(log^ n). 

We use a technique quite similar to the technique from [5]. First we need an 
auxiliary result. 

Lemma ( [8] ). For a grammar G in the Chomsky normal form and for an ar¬ 
bitrary string w = xyz from L{G) of length n there is a nonterminal A in the 
derivation tree, such that A derives y and n/3 ^ |j/| ^ 2nl3. 

Let us return to the proof of the theorem. 

Proof (of Theorem Consider a grammar G' in the Chomsky normal form 
such that L{G') = F. Fix an automaton A with n states such that the minimal 
length of w from L{A)r\F equals prin). The length of the word w is polynomial 
in n. Consider the grammar G such that L{G) = L{A) fl F obtained from the 
grammar G' by the construction from Lemma [5] 

The algorithm does not construct the grammar G itself, since such a con¬ 
struction expands the size of grammar G' up to times. Instead, the algorithm 
nondeterministically guesses the derivation tree of the word w in the grammar 
G, if it exists. Informally speaking, it restores the derivation tree starting from 
its ‘central’ branch. 

The main part of the algorithm is a recursive procedure that checks cor¬ 
rectness for a nonterminal A = [f/A'p] of the grammar G. We say that the 
nonterminal A = [f/A'p] is correct if A produces a word w in the grammar G. 

If a nonterminal is [q(jp\, where ct is a terminal then the procedure should 
check that g p in the automaton A. 

In a general case the procedure of checking correctness nondeterministically 
guesses a nonterminal Ai = such that w = piuisi, and Ai derives 

the word Ui and I/3|w| ^ |mi| ^ 2/3|w|. Then it is recursively applied to the 
nonterminal Ai. If successful the procedure sets i := 1 and repeats the following 
steps: 

1. Nondeterministically guess the ancestor Ai+i = of Ai in the 

derivation tree. There are two possible cases: 

(i) either ^ [q'G'ii+i]Ai in the grammar G (set up G := [q'C 

(ii) or Ai+i Ailn+iG'p'] (set up G := [ri+iG'p']). 

2. Recursively apply the procedure of checking correctness to the nonterminal 
G. 

3. If successful set up i := i-\-l. 

Repetitions are finished and the procedure returns success if Aj = A. If 
any call of the procedure of checking correctness returns failure then the whole 
procedure returns failure. 

In recursive calls the lengths of words to be checked diminish by a factor at 
most 2/3. So the total number of recursive calls is O(logn), where n is the input 
length. Data to be stored during the process form a list of triples (an automaton 


state, a nonterminal of the grammar G', a automaton state). Each automaton 
state description requires O(logn) space and nonterminal description requires 
a constant size space since grammar G' is fixed. Thus the total space for the 
algorithm is O(log^n). 
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